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Abstract 



< 

Cd We study numerical approximations of integrals J,q -^^ fix) dx by averaging the func- 

^ tion at some sampling points. Monte Carlo (MC) sampling yields a convergence of the 

root mean square error (RMSE) of order N~^''^ (where N is the number of samples). 
CO Quasi-Monte Carlo (QMC) sampling on the other hand achieves a convergence of order 

^ iV"i+^ for any e > 0. Randomized QMC (RQMC), a combination of MC and QMC, 

.^ achieves a RMSE of order A^~3/2+£_ ^ combination of RQMC with local antithetic sam- 

OO pling achieves a convergence of the RMSE of order A^~3/2-i/s+e (^-^j^ere s > 1 is the 

^-% dimension). QMC, RQMC and RQMC with local antithetic sampling require that the 

^^ integrand has some smoothness (for instance, bounded variation). Stronger smoothness 

(^-•) assumptions on the integrand do not improve the convergence of the above algorithms 

T—t further. 

^ This paper introduces a new RQMC algorithm, for which we prove that it achieves 

a convergence of the RMSE of order A^^"^i/2+^ if the integrand has square integrable 
partial mixed derivatives up to order a in each variable. Known lower bounds show that 
^ this rate of convergence cannot be improved. We provide numerical examples for which 

the RMSE converges approximately with order N~^''^ and N '"^ , in accordance with 
the theoretical upper bound. 
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1 Introduction 

The problem of numerically integrating a function comes up frequently in science and engi- 
neering. We consider the standardized problem of approximating the integral L ^^ f{x) dx, 
that is, we assume that any transformations necessary to change from different domains and 
density functions have already been carried out. Monte Carlo algorithms use i.i.d. uniformly 
distributed samples Xi, . . . ,xn € [0, 1]* to approximate the integral by -^ ^„=i f{xn). For 
functions / G i^2([0, 1]'^) the Monte Carlo method has a root mean square error (RMSE) of 
0{N~^^'^). An alternative to Monte Carlo is Quasi-Monte Carlo. In this method one designs 
sample points which are more uniformly distribution with respect to some criterion (in one 
dimension this criterion is the Kolmogorov Smirnov distance between the uniform distribu- 
tion and the sample point distribution). These achieve a worst case error which decays with 
0{N~^~^'^) for any e > 0, see |7j. Owen [151 [161 [17] introduced a randomization of QMC which 
achieves a RMSE of 0{N~^^'^~^'^). Owen's randomization method uses a permutation applied 
to digital nets (which is a construction scheme for sample points used in quasi-Monte Carlo) 
called scrambling. A slight improvement of Owen's scrambling method of digital nets can be 
obtained by combining this approach with local antithetic sampling, see [19]. Therein it was 
shown that one obtains a convergence of the RMSE of 0{N~^^'^~^^^~^^) (s is the dimension of 
the domain). The latter three methods require that the function / has some smoothness (for 
instance continuous partial mixed derivatives up to order 1 in each coordinate in the first two 
methods and continuous partial mixed derivatives up to order 2 in each coordinate in the third 
method). No further improvement on the rate of convergence is obtained when one assume 
that the integrand has continuous higher order partial mixed derivatives in each variable. 

In this paper we introduce a randomization of quasi-Monte Carlo algorithms (which use 
digital nets as quadrature points) such that the RMSE converges with C)(A^^"^^/^+^) (for any 
e > 0) if the integrand has square integrable partial mixed derivatives up to order a in each 
variable. This result holds for any a > and it is known that this result is best possible, see 

m- 

For the reader familiar with scrambled digital nets, we briefly describe the algorithm. The 
details on scrambled digital nets will be given in the next section. 

1.1 The algorithm 

The underlying idea of the new randomized QMC algorithm stems from |4, 5j . Central to this 
method is the digit interlacing function with interlacing factor d G N given by 

^, : [0, 1)'^ ^ [0, 1) 



{xi,...,Xd) t-> y^ y^ ^r,ab 



oo d 

r— (a— 1)q 



a=l r=l 



where Xr = C,r,ib~^ + C,r,2b~^ + ■ ■ ■ ior 1 < r < d. Let xq, . . . , Xhm_i e [0, 1)°''* be a randomly 
scrambled digital (t, m, (is) net over the finite field Zb of prime order b (we present the theo- 
retical background on scrambled digital nets in the next section). Let Xn = {xn,i, ■ ■ ■ ,Xn,ds)- 
Then one simply uses the sample points 

y^ = (^d(x„,i, . . . , Xn,d), . . . , ^diXn,d{s-l)+l, • • • , Xn,ds)) G [0, 1)'* for < n< 6™. 

The integral is then estimated using 

n=0 

In Theorem |8] we show that if the integrand has square integrable partial mixed derivatives of 
order a > 1 in each variable, then the variance of /(/) satisfies 

Var [/(/)] = C)(Ar-2mmKa)-i+e) 

for any e > 0, where N = b"^ is the number of sample points. 

Since scrambled digital nets (based on Sobol points) are included in the statistics tool- 
box of Matlab, this method is very easy to implement (an implementation can be found at 
http://quasirandomideas.wordpress.com.) 

1.2 Numerical Results 

Before we introduce the theoretical background, we present some simple numerical results 
which verify the convergence results. 

Example 1 



In this example the dimension is 1 and the integrand is given by f{x) = xe^. Figure 1.2 shows 
the RMSE from 300 independent replications. Here, the straight lines show the functions 
jY-3/2^ j\f-5/2 ^^^ jY-7/2_ rjnj_^g other lines are the RMSE where the digit interlacing factor d is 
given by 1 for the upper dashed line, 2 for the dashed line in the middle and 3 for the lowest of 



the dashed lines. Figure 1.2 shows that in each case the RMSE converges approximately with 
order N^'^^^^'^ (for large enough A^). (The result for d = 1 appears to perform even better 
than N-^/\) 




Figure 1: The green lines show N~^^'^ and the standard deviation where d = 1, the blue lines 
show A^~^/^ and the standard deviation where d = 2 and the red lines show A^~^/^ and the 
standard deviation where d = 3. 



Example 2 



ye 



We consider now a 2 dimensional example where the integrand is given by f{x,y) — ^_^. 
This function was also used in [TU] where the sample points are obtained by scrambling and 
local antithetic sampling. 



Figure |1.2| shows again the RMSE for 300 independent replications. The straight lines 
show the functions N~^^'^ and N'^/"^. The two dashed lines show the RMSE when d = 1 
(upper dashed line) and when d = 2 (lower dashed line). Figure 1.2 shows that in each case 
the RMSE converges approximately with order N~'^~^/'^ (for large enough A^). 

In the following section we give the necessary background on QMC, digital nets, scrambling 
and Walsh functions. We then prove in Section [3] what can be observed from the numerical 
results, namely, that if the integrand has square integrable partial mixed derivatives of order 
a in each variable, then we obtain a convergence of the RMSE of (9(A^~™™("''^)^^/^+'^) for any 
e > 0. A short discussion of the results is presented in Section |4} Some properties of the digit 
interlacing function ^^ necessary for the proof is presented in Appendix A and a technical 
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Figure 2: The green lines show A^ ^^^ and the standard deviation where d = 1, the blue lines 
show iV~^/^ and the standard deviation where d = 2. 



proof on the convergence of the Walsh coefficients is presented in Appendix B. 

2 Background and notation 

In this section we give the necessary background on QMC methods. Some notation is required, 
which we now present. 

In this section, c, C > stand for generic constants which may differ in different places. 

Throughout the paper we assume that 6 > 2 is a prime number. We always have k = 

[Ki, . . . , kg), k = [ki, . . . , kg), X = [Xi, . . . , Xg), y = [yi, . . . ,yg), Xn = {Xn,l, • • • , Xn^s)} Vn ~ 



2.1 Quasi-Monte Carlo 

Quasi-Monte Carlo algorithms /(/) = ^ ^„=o fi^n) are used to approximate integrals /(/) = 
r ^,^ f{x) dx. The difference to Monte Carlo is the method by which the sample points 



Xq, . . . , Xn-1 G [0, 1)'^ are chosen. The aim of QMC is to chose those points such that the 
integration error 



1 ^"' 

/ f(x)dx——-'S^f(Xn 



achieves the (almost) optimal rate of convergence as A^ — t- oo for a class of functions / : 
[0, 1]* — 7- M. For instance, for the set of all such functions / which have bounded variation in 
the sense of Hardy and Krause, which we write as ||/||hk < oo, it is known that the best rate 
of convergence for the worst case error is 



e = sup 

/:II/IIhK<00 



1 ^"' 



N~^+' for all 6>0. 



(More precisely, there are constants c, C > such that cN ^{logNY^ ^^^'^ < e < CN ^(log A^)"* ^ , 
see [7].) 

Choosing the points Xq, . . . , x^^i G [0, l)"* randomly as in MC, does not yield this rate of 
convergence. Even if a function has bounded variation in the sense of Hardy and Krause one 
obtains only a convergence of order N~^^'^ for randomly chosen sample points. 

There is an explicit construction of the sample points Xq, . . . , x^^i for which the optimal 
rate of convergence is achieved. The essential insight is that the quadrature points need to be 
more uniformly distributed than what one obtains by choosing the sample points by chance. 
One criterion for how uniformly a set of points P/v = {^^O; • • • ; Xjq_i} is distributed is the star 
discrepancy 

T7 5ZW[o,.)-Vol([0,z)) , 



D*n{Pi 



N) 



sup 



N 



n=0 



where [0, z) = HLitO'^i) ^i^^ z = {zi,...,Zs), 
and 



Vol([0,z)) = UUi^i^ the volume of [0,z) 



1 



x,e[o,z) 



1 a Xi e [0, z) 

otherwise. 



When s = 1 this becomes the Kolmogorov-Smirnov distance between the empirical distribution 
of the points and the uniform distribution. Further we call 






2) 



Vol([0,2)) 



n=0 



the local discrepancy (of Pn)- 



<D^(P^)||/||hk. 



The connection of this criterion to the integration error is given by the Koksma-Hlawka 
inequahty 

N-l 

An exphcit construction of point sets P^ = {xq, . . . , x^^i} G [0, 1)* for which D^(Pjv) < 
CA^~^(log A^)*~^ is given by the concept of digital nets, which we introduce in the next sub- 
section. Notice that for such a point set, the Koksma-Hlawka inequality implies the optimal 
rate of convergence of the integration error, since for a given integrand, the variation ||/||hk 
does not depend on P^r and A^. 

2.2 Digital nets 

A comprehensive introduction to digital nets can be found in [TJ [13]. 

The aim is to construct a point set P/v = {^o, ■ ■ ■ , a^Af-i} such that the star discrepancy 
satisfies D^(P/v) < CA^~^(log A^)^~^. To do so, we discretize the problem by choosing the 
point set P^ such that the local discrepancy Sp^{z) = for certain z e [0, 1]* (those z in turn 
are chosen such that the star discrepancy of Pat is small, as we explain below). 

It turns out that, when one chooses a base b > 2 and N = b"^, then for every natural number 
m there exist point sets Pbm = {xq, . . . , Xbm-i} such that 6p^,r, (z) = for all z = {zi, . . . ,Zs) 
of the form 

Zi = TT for 1 < 2 < s, 

where < a^ < 6'^* is an integer and di + ■ ■ ■ + dg < m — t with di, . . . ,ds > 0. Crucially, the 
value of t can be chosen independently of m (but depends on s). A point set P/v which satisfies 
this property is called a (t, m, s)-net in base b. An equivalent description of (t, m, s)-nets in 
base b is given in the following definition. 

Definition 1 Let b > 2, m,s > 1 and t > be integers. A point set Pf,m. = {xq, . . . , Xhrn_i} c 
[0, 1)'^ is called a {t, m, s)-net in base b, if for all nonnegative integers di, . . . ,ds with di + - ■ ■ + 
dg < m — t, the elementary interval 



n 



i=l 



tti flj + 1 



contains exactly &"*-* points of P^m for all integers < a^ < 6* . 



It can be shown that a {t, m, s)-net in base b satisfies 

s — 1 
771 



see [U 113] for details. 

Exphcit constructions of (t, m, s)-nets can be obtained using the digital construction 
scheme. Such point sets are then called digital nets (or digital (t, m, s)-nets if the point 
set is a {t,m, s)-net). 

To describe the digital construction scheme, let 6 be a prime number and let Z;, be the 
finite field of order b (a prime power and the finite field F^ could be used as well). Let 
Ci, . . . ,Cs E '^dmxm -^^ ^ matrices of size dm x m with elements in Zf, and d E N. The 
ith coordinate Xn,i of the nth point Xn = (a;„,i, . . . ,Xn,s) of the digital net is obtained in the 
following way. For < n < b"^ let n = no + riib + ■ ■ ■ + nm-ib'^~^ be the base b representation 
of n. Let n = {uq, . . . , rim-i)^ E Z™ denote the vector of digits of n. Then let 

yn,i = CiU. 

For yn,i = {yn,i,l, ■■■, yn,i,dmV ^ ^fc™ WC Set 

yn,i,l yn,i,dm 

The construction described here is slightly more general to the classical concept to suit our 
needs (the classical construction scheme uses d = 1). In this framework we have that if 
{xq, . . . , Xbm_i} is a digital (t, m, ds)-net, then {^^(ajo), . . . , ^d{xbm_i)} is a digital (t, m, s)- 
net. 

The search for (t, m, s)-nets has now been reduced to finding suitable matrices Ci, . . . , C^. 
Explicit constructions of such matrices are available, see [H US] . 

2.3 Walsh functions 

To analyze the RMSE we use the Walsh series expansions of the integrands. In this subsection 
we recall some basic properties of Walsh functions used in this paper. First we give the 
definition for the one-dimensional case. 

Definition 2 Let b > 2 be an integer and represent k eNq in base b, k = Ka-ib"-~^ + ■ ■ ■ + kq, 
with Kj G {0, . . . , 6 — 1}. Further let cuf, = e^"^*/^. Then the kth Walsh function ^waU : [0, 1) — )■ 
{ 1, cjft, . . . , (Jl~^ } in base b is given by 

bwalfc X = w. 



for X G [0, 1) with base b representation x = Xib ^ + X2b ^ + ■ • • (unique in the sense that 
infinitely many of the Xi are different from b — 1). 

We now extend this definition to the multi-dimensional case. 

Definition 3 For dimension s > 2, x = {xi, . . . , Xs) G [0, 1)"^, and k = {ki, . . . ,ks) G Nq, we 
define ^walfc : [0, l)"" -> {l,Ub,... ,ujI~^} by 

s 

bwalk{x) = Y\ 6walfc^,(a;j). 
j=i 

As can be seen from the definition, Walsh functions are piecewise constant. For 6 = 2 they 
are also related to Haar functions. 

We need some notation to introduce some further properties of Walsh functions. By 
© we denote the digitwise addition modulo b, i.e., for x,y E [0,1) with base b expansions 
X = X^i^i 2^j& * and y = Xll^i Vi^^ we define 



oo 

X (By = 

i=l 



J2zib~ 



where Zi G {0, . . . , 6 — 1} is given by Zi = Xi + yi (mod b), and let denote the digitwise 
subtraction modulo b. In the same manner we also define a digitwise addition and digitwise 
subtraction for nonnegative integers based on the 6-adic expansion. For vectors in [0, 1)* or 
Ng, the operators © and © are carried out componentwise. Throughout this paper, we always 
use base b for the operations © and ©. Further we call x G [0, 1) a 6-adic rational if it can be 
written in a finite base b expansion. In the following proposition, we summarize some basic 
properties of Walsh functions. 

Proposition 1 1. For all A;, / G Nq and all x,y E [0, 1), with the restriction that if x, y are 
not q-adic rationals, then x ®y is not allowed to be a b-adic rational, we have 

fewalfe(a;) ■ 6wal;(a;) = walfce/(a;), 6walfc(a;) ■ 6waU(i/) = bwalfc(a; © y). 

2. We have 

/ 6walo(a;) da; = 1 and / b^alk{x) da; = if k > 0. 
Jo Jo 

3. For all k,l eNq we have the following orthogonality properties: 



4- For any f G /^2([0, 1)'') and any a G [0, l)"* we have 



f{x®(T)dx = / f{x)dx. 

[0,1)" -^[O,!)" 

5. For s G N, the system { fewal^ : k = {ki, . . . ,ks),ki, . . . ,ks > 0} is a complete orthonor- 
mal system in £2([0, 1]''). 

The proofs of 1. — 3. are straightforward, and for a proof of the remaining items see |3] or 
[TJ [21] for more information. 

Let d > 1 and fci, . . . , fc^ G Nq. Let k^ = Kj q + ^i,i^ + ' ' ' ; where Kj „ G {0, . . . ,b — 1} 
and Kj^a = for a large enough. To analyze the RMSE, it is convenient to extend the digit 
interlacing function ^a to 

oo d 
{ki,...,kd) ^ ^^ '^r,ab'''^' 



-ad 
a=0 r=l 

Then we have 



fewal<^^(fci„..,fcd)(^d(a;i, . . .,Xd)) = JJ bwalk^Xi). 



i=l 



2.4 Scrambling 

The scrambling algorithm which yields the optimal rate of convergence of the RMSE uses 
the digit interlacing function and the scrambling introduced by Owen [T5| [T6| ITfj . which we 
describe in the following. 

2.4.1 Owen's scrambling 

Owen's scrambling algorithm is easiest described for some generic point x G [0, l)**, with 
X = (xi, . . . , Xs) and Xi = ^i,ib~^ + ^j,2&~^ + ■ • ■ . The scrambled point shall be denoted by 
y G [0, 1)*, where y = {yi, . . . ,ys) and yi = rji^ib^^ + rii^2b~^ + ■ ■ ■ . The point y is obtained by 
applying permutations to each digit of each coordinate of x. The permutation applied to ^i^i 
depends on ^i^k for 1 < A; < /. Specifically, 77^,1 = 7ii{^i,i), 77^,2 = T^i,^,,A^i,2) , ^i,3 = '^1,(^,1,^^,2(^1,3), 
and in general 

?7i,fc = Tri,(,,i,...,(i,k-i{^i,k), (1) 
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where '^i,^,-i,...,^ik_i is a random permutation of {0, . . . , 6 — 1}. We assume that permuta- 
tions with different indices are chosen mutually independent from each other and that each 
permutation is chosen with the same probability. 
To describe Owen's scrambling, for 1 < i < s let 

n^ = {7ri,g,i,-€.fe-i -keN, ei,i, . . . , ^i,k-i G {0, . . . , 6 - 1}}, 

where ior k = 1 we set ''^i,i,^i,...,^i ^-i = ^j^ be a given set of permutations and let 11 = 
(III, • • • , n^). Then, when applying Owen's scrambling using these permutations to some point 
X G [0, 1)*, we write y = Xu, where y is the point obtained by applying Owen's scrambling 
to X using the set of permutations 11 = (IIi, . . . , Ug). For x G [0, 1) we drop the subscript i 
and just write y = xjj- 

To analyze the RMSE it is also convenient to generalize Owen's scrambling to higher order. 

2.4.2 Owen's scrambling of order d 

We now describe what we mean by Owen's scrambling of order d > 1 for a generic point 
X G [0, l)**. The scrambled point y G [0, 1)^ is given by 

y = ^4^^\x)u), 
that is, one applies the inverse mapping &^^ (see Appendix A for more information on ^d) 



to z 



to the point x to obtain a point z G [0, l)'^*, applies Owen's scrambling of Section 2.4.1 

to obtain a point w = zn G [0, l)*^* and then use the transformation ^^ to obtain the point 

y = %iw) G [0, 1)^ 

Assuming that the permutations are all chosen with equal probability, then the point y is 
uniformly distributed in [0, 1)*. 

Proposition 2 Let x G [0,1)'' and let II be a uniformly and i.i.d. set of permutations. 
Then ^d{.^d^{x)ii) is uniformly distributed in [0,1)^, that is, for any Lebesgue measurable 
set G C [0,l)^ the probability that &d{%^{x)u), denoted by Vioh[9d{%^{x)u)] = A,(G), 
where Xg denotes the s-dimensional Lebesgue measure. 

This result follows along the same lines as the proof of [ISl Proposition 2] . 

2.4.3 Owen's lemma of order d 

A key result on scrambled nets is Owen's lemma (see [16]) which we now generalize to include 
the case of scrambling of order d. Let A; G N have base h representation k = KQ + K,ib+- ■ ■ + k,J)"'- 
For < r < (i let 

kr = KrF + Kr+db'~^'^ + ■■■ + t^aX^ 
11 



where a^ < a is the largest integer such that d divides a^ — r. li a < r we set kr = 0. 

For X = ^ib~^ + ^2^"^ H and a;' = ^^b'^ + ^'^h''^ H and for < r < rf let I3r be the 

largest integer such that ^r = C^^r+d = C+d^ ■ ■ • >^r+/3,d = C+p^d^ and ^r+{f3r+i)d 7^ C+(/3,+i)d- 

Lemma 4 Let y, y' G [0, 1) be two points obtained by applying Owen's scrambling algorithm 
of order d > 1 to the points x, x' G [0, 1). 

(z) Ifk^k', then 



6waU(|/)fewaW(y') 



0. 



1 Y^fe™-! 



E 

(a) If k = k' and there exists an < r < d such that kr > b^'^^^, then 

E[bwa\k{yey')] = 0. 

(Hi) If k = k' and kr < 6^''+^ for <r < d, then 

E[twah{yey')] = {l-b)-\ 

where 

v=\{0<r <d:b^^ <kr< 6^'+^}| . 

The proof of this result follows immediately from [3, Lemma 13.23]. 

In the next section we analyze the variance of the estimator /(/) = ^ Yln=o^ fiVn)- 

3 Variance of the estimator 

Let / G -^2([0, 1]'^) have the following Walsh series expansion 

fix) ~ Yl /(^) b^M^) =■■ Six, /). (2) 

fceNg 

Although we do not necessarily have equality in ^ , the completeness of the Walsh function 
system { ^walfc : k G Nq} (see [?]) implies that we do have 

Var[/]=5^|/(fc)P=Var[5(-,/)]. (3) 

feeNg 

We estimate the integral L ^,, f{x) dx by 



b^-l 



^(/) = i^Y. f^yn)^ 



6" 

n=0 
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where x/q, . . . , yf,m_i G [0, 1)'^ is obtained by applying a random Owen scrambling of order 
d to the digital (t, -m, s)-net Pbm = {xo, . . . ,xi)m_i} (below we shall assume that there is a 
digital (t, m, ds)-net {zq, . . . , Zb^-i} such that Xn = ^d{zn) for < n < 6"*, but for now the 
assumption that P^rr^ is a digital (t,m, s)-net is sufficient). From Proposition [2] it follows that 



Hence in the following we consider the variance of the estimator /(/) denoted by 

Var[/(/)]=E[(/(/)-E[/(/)])2]. 



ds 



The following notation is needed for the lemma below. Let d > 1 and I = (Zi, . . . , Z^) G Ng , 
where k = (/{i-i)d+i, . . . , hd)- Let 

Bd,i,s = {{ku • • • , kds) G M^^ : [b'^-'\ <h< h'^ for 1 < ^ < ds]. 

We set 

Consider s = 1 for a moment. Let I G Nq. Then LemmaHimplies that for (fci, . . . , fc^), (/c'^, . . . , /c^) G 
Brf ; 1 we have 



E 



bwal(fc,,...,fc,)(^^ ^(x)n) fewal(fc,,...,fc^)(^^ ^(a;')n) 



E 



bwal(fc'^,...,fc;^)(^^ ^(a;)n) b^ii\k'„...M'A^d \^')ii) 
Hence, for s > 1 and I G Nq^, choose an arbitrary k G Bd^i^s, and set 

6™-l s 



(4) 



i'=0 i=l 



LdXA™) = 7^ XI n^ b^^\k,^,-i)+i,-,k,d^d \xn,i)n) bwa\k,^^_^^+„...,k,d^d \xn',^)^) 



Equation Q implies that this definition is independent of the particular choice of fc G Bd^i,. 
We call Td,i{Pb"^) the gain coefficient (of Pbm)(of order d). 

Lemma 5 Let d > 1. Let f G i^2([0, 1]"*) and 



b"^-l 



^(/) = j^Yl f^yn)^ 



n=0 
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where t/g, • • • , Vb^-i ^ [0, 1)'^ is obtained by applying a random Owen scrambling of order d to 
the digital net Pbm. = {xq, . . . , Xbm^i}. Then 

Var[/(/)]= Yl <iAf)UiiPi>-)- 
Proof. Using the linearity of expectation and Lemma |4] we get 

Yl f(^)f(^"li^ Yl bwalfc(2/„)fewaW(?/„,^ 

fe.fc'eNfjVJO} n,n'=0 
6™-l s 



Var[/(/)] 



E 



Y K^)K^')v^ XI W'^\b'^sikXyn,i)b^a[k[{y' 



fc,fe'eNg\{o} 



n,n'=0 i=l 
6^-1 s 



fceNg\{0} n,n'=0 J=l 

n,n'=0 «=1 

ieN^»\{o} 

Hence the result follows. 

To obtain a bound on the variance Var [/(/)] we prove bounds on ad,i,s{f) and Td,i{Pb' 
which we consider in the following two subsections. 



D 



3.1 A bound on the gain coefficients of order d 

In this section we prove a bound on rd,i{Pb"^), where the point set is a digital {t,m, s)-net as 
constructed in 0. 

Lemma 6 Let {zq, . . . , Z(,m_i} be a digital {t,m, ds)-net over Z^. Let x^ = !^d{zn) for < 
n < b"^. Then the gain coefficients of order d for the digital net Pbm = {xq, . . . , Xb^^i} satisfy 

if \l\i ^ fn — t, 

Td,i{Pb^) < { ^'^'~'''' ^fm-t<\l\l<m-t+\q\, 
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Proof. Let k = {ki, . . . , k^s) and I = [Iq, 0) for some g C {1, . . . , s}. Then from the proof of 
[7[ Corollary 13.7] and [TJ Lemma 13.8] it follows that 



6^-1 s 



Td,i{Pb^) = ^ Xl n^['''^^W-i)+iv---fc<i.)(^/(^n,On)fewal(fc/^^_^^^^,...,fc^^)(^/(x„.,i)n) 



n,n'=0 i=l 
6™-! s 



^ ^ niE I fewalfc(2„)n)bwalfc(zn')n) 



ji,n'=0 i=l 

if |Z|i < m, — t, 

5kl-|«|i if m-t < |/|i < m-t+ |g|, 

6^*"+* if |/|i >m-t+ |g|. 

Hence the result follows. □ 

3.2 Higher order variation 

In this subsection we state a bound on crd,Ls{f)- The rate of decay of <yd,i,s{f) depends on the 
smoothness of the function /. We measure the smoothness using a variation based on finite 
differences, which we introduce in the following. Since the smoothness of the function / may 
be unknown, we cannot assume that we can choose d to be the smoothness. Hence, in the 
following we use a to denote the smoothness of the integrand /. 

3.2.1 Finite differences 

We use a slight variation from classical finite differences. Let / : [0, 1] — )■ M and let zi, Z2, ■ ■ ■ & 
(—1, 1) be a sequence of numbers. Then we define Ao(x)/ = /(x) and for a > 1 we set 

Aq,(x; Zi, . . . , Zajf = Aq,_i(x + Za',Zi, . . . , Za-l)f — Aq,_i(x; Zi, . . . , Za-l)f. 

For instance, we have 

Ai{x;zi)f = /(x + 2i) -/(x), 
A2{x;zi,Z2)f = f{x + zi + Z2) - fix + Z2) - f{x + zi) + f{x), 



and in general 

^a\Xj Zli ■ ■ ■ 1 Zq 



)f= Y. (-1)1^1/ fx + E-^) 

t>C{l,...,a} \ iev / 
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where \v\ denotes the number of elements in v. We always assume that x + J2i£v ^i ^ t^) 1] 
for all f C {1, . . . , a}. 

If / is q; times continuously differentiable, then the mean value theorem implies that 

d/ 

Aa{x; Zi, . . . , Za)f = Za^a-liCl, ^1, • • • , Za-l)-;—, 

ax 
where min(a;, x + Za) < Ci ^ max(a;, x + z^)- By induction, it then follows that 

d"/ 

Aq,(x; Zi,. . . , Za)f = Z\- ■ ■ Za-7—{C,a)-, 



where 

X + min y -2i < Ca ^ 2; + max y z-i. 

«C{l,...,a} -^^^ i.C{l,...,a} -^^^ 

We generalize the difference operator to functions / : [0, 1]* — ?■ M. Let a > be a 
nonnegative integer. Let Aj „ be the one-dimensional difference operator A^^ applied to the ith 
coordinate of /. For ex = (ai, . . . , as) G {0, . . . , a}* and 1 < i < s let Zi^i, . . . , Zj^q- G (—1, 1). 
Then we define 

Aa{x; {Zi^i, . . . , Zi^ai), • • • , (^s,l, • • • , Zs,aJ)f 

^1,Q1 \Xi, Z\-i^ . . . , Ziq.^) • • • AA^^q^ \Xg, Zg^i, . . . , ZgQi^) J 

■!;iC{l,...,ai} VsC{l,...,as} \ ii€vi is&Vs / 

If / has continuous mixed partial derivatives up to order a in each variable, then, as for 
the one-dimensional case, we have 

^ "» (9"!"' — '""^ f 

Aa(a3, (^1,1, . . . ,^l,ai), • • • , (^s,l, • • • ,Zs,as))f = II II ^i,ri a "l . . . /} "s (^l'"l' ' ' ' 'Cs.oJ, (5) 

j=l ri=l ^ * 

where we set Hr =i -^j.n = 1 for <^i = and where 

+ min V" Zi^r < Ci,a, <Xi+ max V" Zi^r 

vC{l,. ..,ai} ^ — ' vC{l,. ..,ai} ^ — ' 



Xi 



rGv r&v 



for 1 < i < s. Again we assume that Xi + X^rev '^«.'' ^ ['-'' -'-] ^'^^ all i; C {1, . . . , ctj}, Q^^. g [0, 1] 
for all < a,- < a and 1 < i < s. 
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3.2.2 Variation 



Let / : [0,1]^ 



-)■ 



and a > be a nonnegative integer. Let J = Y['i=i[ 



as iai_ aj+l 



), with 



< ttj < 6'' and /,/ G N for 1 < i < as. Apart from at most a countable number of points, the 



set &a{J) is the product of a union of intervals. Let a 
define the generalized Vitali variation by 



«i, 



, a« 



G {1, . . . , aY . Then we 



vLHf) 




Vol(^«(J))sup 



/\n,{t\Z^ 



Zs)f 



m=iU 






1/2 



(6) 



where the first supremum supp is extended over all partitions of [0, 1)"'^ into subcubes of 
the form J = nr=i[f7' ^) "^^^^ < a^ < \}' and Zj G N for 1 < i < as, and the second 
supremum is taken over all t G 2ia{J) and Zi = {zi^i, . . . , Zi^aJ with Zi^r = Ti^rb~°'^''^~^^~'^ where 
Tj r G {1 — b, . . . ,b — 1} \ {0} for 1 < r < a, and 1 < "i < s and such that all the points at 
which / is evaluated in Aa{t; Zi, . . . ,2^) are in ^aiY['i=i[b~^'~^^['^i/b\,b^^''^^{[ai/b\ + 1)). 

In Appendix A it is shown that Vol(^a(-^)) = Vol(J), the volume (i.e. Lebesgue measure) 
of J. Hence, if the partial derivative „ Li „ If are continuous for a given (ai,...,^^) G 

{1, . . . , a}'^, then it can be shown that ([s]) and the mean value theorem imply that the sum ^ 
is a Riemann sum for the integral 



vi'Hf) 




gai-\ has f 

dx'^' ■ ■ ■ dxf 



\x 




For 7^ u C {l,...,s}, let \u\ denote the number of elements in the set u and let 



^aT {fu]u) be the generalized Vitali variation with coefficient q;„ G {1, 
dimensional function 



a 



}l"l of the 



\u\ 



Ju\'^u) 



[o,ip 



f{x)([X{i^,„^s}\u- 



For M = we have /0 = L -^,^ f{x) dx and we define Va (h; 0) = \h 
Then 



1/2 



Vaif) 



^uC{l,...,s} ae{l,...,a}\^\ 

is called the generalized Hardy and Krause variation of / of order a. A function / for which 
Va{f) is finite is said to be of bounded variation (of order a). 
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If the partial derivatives Li „ If are continuous for all (ai, . . . , a^) G {0, . . . , aY, then 
variation coincides with the norm 



VM) 



/ o \ 1/2 

77^.1 _. ,,„,^fo,ill"i ^fo,il=-i«l 1L;p„c'3;j 



\uC{l,...,s}ae{l,...,a}l"l ^°'"^1' 



dXu 



3.2.3 The decay of the Walsh coefficients for functions of bounded variation 

The following lemma gives a bound on crd,i,s{f) for functions / of bounded variation of order 



a. 



Lemma 7 Let a,d E N. Let f : [0, 1]* — ;■ M with V^(/) < oo. Let b > 2 be an integer. Let 
1 = {li,...,lds) eN^' and let K = {i e {l,...,t/s} : k > 0}. Let K, = Kn{{i-l)d+l, . . . ,id} 
andai = min(a, \Ki\) for I <i < s. Let^j = (5 — l)5~J'+(*"i)'^"('3"i)'^ forj E Ki and 1 <i < s. 
Let 7j,i < 7i,2 < • ■ • < 1i,a, for 1 < i < s be such that {-fi^i, . . . ,7i,aJ = {ij '■ J ^ Ki}, that 
is, {'jij : 1 < j < ttj} is just a reordering of the elements of the set {jj : j E Ki}. Set 
7(0 = rc=in;:i7M- Then 

ad,iAf) < r^^'-^'-^'^hmaif). 

The proof of this result is technical and is therefore deferred to Appendix B. 

3.3 Convergence rate 

We can now use Lemmas |5j |6] and [7] to prove the main result of the paper. 

Theorem 8 Let a,dEN. Let f : [0, 1]* -^ R satisfy Va{f) < oo. Let 

n=0 

where Vq, ■ ■ ■ ,ybm_i E [0,1)'^ with y^ = ^d((a;„)n) and Xq, . . . ,Xbm_i E [0,1)'^'* is a digital 
{t,m,ds)-net and the permutations in LI are chosen uniformly and i.i.d.. Then 

^ Ijy. M\imn{a,d)s+s 

Var [/(/)] < Cb^s,aVaU) ^-(2min(a,d)+l){m-t) ' 

where Cb,s,a > is a constant which depends only on a,b,d,s, but not on m. 



Proof. Let d < a. Then from Lemmas ^ p| H and the fact that V^(/) < Ki(/) we obtain 
that 

Var[/(/)] < K.(/)(6-l)2'^'^6^+'^('^-i)r(™-*+^) ^ fe-^'^l'l^ 

«eN,'5M«|i>m-t 



< \4(/)(&-l)^'^''6'+'^^'^ 



-l)L-(m-t+l) Y^ L-2dfe fk + ds - l\ 

^ \ ds-l 



< V (f)(b — l)2<is/^2d _ -|^N-a!s^2d2s+6+d(d-l)^-(2d+l)(m-t+l) /^"^ t + US 



where we used ^21 Lemma 13.24]. Since 

4- ds) ■ ■ -(m, -t-\-2) 

<(m-t + 2 



m ~t + ds\ {m — t + ds) ■ ■ ■ {m — t + 2) \ds-i 



ds-l J (ds-l) 1 

we obtain 

Var[/(/)] < a,,,,,,K.(/)r(2'^+i)("^-*)(m - t + 2)'^^-\ 

for some constant Ca,^d,s > which depends only on a, b, d, s. 

Let now d > a. In the following we sum over all I = {li,...,ls) G Nq*, where li = 

{l{i^i)d+i, ■■■, hd), and such that Zi H \- Ids > m - t. Let l[i_i)a^i > l[i-i)d+2 > ■ ■ ■ > C 

be such that {l'u_i\^j^i, ■ ■ ■ , l'^^} = {l(i-i)d+i, • • • , hd}, that is, the l[ are just a reordering of the 
elements h. There are at most (rf!)* reorderings which yield the same l\, . . . ,1'^. Then we have 

\\li,3 <ip- l)°»6('>'-l)+("'-2)+-+{d-a,) TT^-< ^ ^^ _ ^^)a^d(d-l)/2^-dE;ii''(._i),+,._ 

Hence we have 

Var[/(/)] < K.(/)4^('^-°)(6-l)2°6^+'^('^-i)(rf!)^r(™-*+i) ^ ^-2dE-=iE"=i'(.-i)d+,^ (7) 

t ordered 

where I = {h, . . . , Ids) ordered means that l(i-i)d+i > ■ ■ ■ > hd ioi 1 < i < s. Hence we have 

J s a 

m-t <h^ \-lds< -"^"^ l{i-i)d+j- 

" i=i i=i 
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Let now ki = l(i-i)d+i H — ■ + l(i-i)d+a- Then ki > a/(i-i)d+j ioi a < j < d and ki-\ \- k^ > 

a{m — t)/d. Hence 



E '- 



2a!Z]i=i Yl'j=i l{i-i)d+j 



ieN^'',\i\i>m-t 

I ordered 



< 



< 



< 



fci,...,A:seNo,fciH |-fcs>a(m-t)/d i=l ^ ^ ^ 



E 



^-2{pi+-+p») TT 



pi,...,PseNo,piH hPs>a(m-t) 



i=l 



[Pi/c/] + « - 1 



E 



pi,...,Ps6NoiPiH hPs>a(m.-t) 



-2{pi+-+p,) fPi _^2 

d 



a — 1 



sd 



Vi_ 
ad 



s(d—a) 



sd 



< E ^-t:!r)(^^ 

oo 
p=a(m— t)+l 

< r2"('"-*)(a(m - t) + 2)*'^+*(s(t/ + 1) - 1) X 

max(l, (s(rf + 1) - iy^''+^^-\a{m - t) + l)-(^('^+i)-i)(log6)-(^('^+i)-i)). 

Thus the resuh follows from ([T]). 



D 



4 Discussion 

In this paper we have extended the results of [T71 [19] , by introducing an algorithm and proving 
that this algorithm can take advantage of the smoothness of the integrand a, where a G N can 
be arbitrarily large. Theorem M shows the convergence rate of the standard deviation of the 



1.2 



estimator /(/) of C'(iV->^i"("''^)-V2(iog A^)*°^in("+i.'>'+i)/2). The numerical resuhs in Section 
using some toy examples also exhibit this rate of convergence. The upper bound is best 
possible (apart from the power of the log A^ factor), since there is also a lower bound on the 
standard deviation, see [T^ . 

The improvement in the rate of convergence in [12] has been obtained by using variance 
reduction techniques. Conversely, one might now ask whether the methods developed here 
can be used to obtain new variance reduction techniques. (Some similarities between this 
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approach and antithetic samphng can be found in [B].) This is an open question for future 
research. 

Since the classical scrambling by Owen [15] is computationally to expensive, variations of 
this scrambling scheme have been introduced which can easily be implemented. Matousek [TTl 
\n\ describes an alternative scrambling which uses less permutations and is therefore easier 
to implement, see also [TU]. Another scrambling scheme which can be implemented is by 
Tezuka and Faure [20]. See also [HI UHl [19] for overviews of various scramblings. The idea is 
to reduce the number of permutations required such that Owen's lemma still holds. Since the 
proof of Lemma |4] follows along the same lines as the proof of Owen's lemma, the simplified 
scramblings mentioned above also apply here. 

The only alternative algorithm which achieves the same convergence rate of the RMSE as 
proven here is based on using an approximation A[f) to the integrand / and then applying 
MC to A{f) — f. The integral is then approximated by I{A{f) — f) + L ^u A{f){x) dx where 
L j^p A{f){x) dx can be calculated analytically. See [H |9] for details. 

5 Appendix A: Properties of the digit interlacing func- 
tion 

The digit interlacing function has several properties which we investigate in the following and 
which we use below. 

Lemma 9 Let d > 1. Then the mapping 31^ : [0, l)'^* — )■ [0, 1)** is infective but not surjective. 

Proof. It suffices to show the result for s = 1. First note that the digit expansion of 

^dixi, ...,Xd) is never of the form Ci6"^ H h 96"^'+^ + {b - 1)6"^' + (6 - 1)6"^'~'^ + (& - 

]^5-i-2d -)-... ^ since this would imply that there is a Xj^, 1 < Jo ^ d, which is a 6-adic rational. 
But in this case we use the finite digit expansions of Xjg and hence no vector (xi, . . . , Xd) gets 
mapped to this real number. Thus 3d is not surjective. 

To show that 3d is injective, let (xi, . . . ,Xd) 7^ (|/i, . . . ,yd) G [0, 1)''. Hence there exists 
an 1 < i < d such that Xi 7^ yi, and hence there is a fc > 1 such that Xi^k 7^ yi,k, where 
Xi = Xi^ib~^+Xi^2b~'^ + - ■ ■ and yi = yi,ib~^+yi^2b^'^ + - ■ ■ (and where we use the finite expansions 
for 6-adic rationals). Thus the digit expansions of 3d{xi, . . . , Xd) and 3d{yi, . . ■ ,yd) differ at 
least at one digit and hence 3d{xi, . . . , Xd) j^ 3d{yi, ■ ■ ■ ,yd)- ^ 

(Notice that a countable number of elements could be excluded from the set [0,1)* such that 
3d becomes bijective.) 



21 



Lemma 10 Let d > 1 and J = YliLA'^tj^i) ^ [0? ^Y'"^ with ai < hi for 1 < i < ds. Let X^ 
denote the Lehesgue measure on M". Then \ds{J) = ^s{^d{J)) ■ 

Proof. The result is trivial for d = 1. Let now d > 1. 

Consider s = 1. Let J = Y[i=i[^i^~'^'' ^ i^i + ^)b~'^'), where < Oj < b"' is an integer and 



+ ^s- 





ai tti^i 
b^i~ b 


ai,2 
62 + 


1 '^«,i^i 




for some integers z/j > 0. 


Let f = (z/i, . . . , Ua), i^ oo = maxi<j<s z/j and u i = 


= Z/1 + 


Then Arf(J) = b'^'^^K 








Consider now &d{J)- 


Let < c < 6'^l''l- and 














with ci,...,Cd|^|^ G {0,. 


. . ,6 — 1}. We have 








^-^(^) = u 


c 


c + n 






l,dHoo 





where the union is over all c with expansion as above and where ci, . . . , Cd\u\^ G {0, . . . , 6 — 1} 
with the restriction that a^^fc = C(^k-i)d+i for ^ ^ k < Vi and 1 < i < (i. Hence there are 
c^l^loo ~ l^^li digits Cj free to choose. Therefore 



Ai(^,(J)) = Ai 



^+ 1 ^ ^ ljd\u\^-\u\^ ^ ^-|,.|i_ 



Therefore the result holds for intervals of the form J . 

It follows that the result holds for intervals of the form J = YliLA^^i^'^' ^ {(^i + 1)^"'^')) since 
this interval is simply a product of the previously considered intervals. 

Let now J = Hjiil'^ii^j) — [Oj ^Y^j with a^ < bi for 1 < i < ds, be an arbitrary interval. 
Since this interval can be written as a disjoint union of the elementary intervals used above, 
the result also holds for these intervals. 

Let 7^ / C {1, . . . , ds} and Oj = bi for i E L Then Xds{J) = 0. On the other hand, define 

h' — j '^* ^~ ^ "^ ^'^^ i E I, 
* [^ bi otherwise, 

where z/ is large enough such that b'i < 1 for all 1 < i < ds. Set J' = YliLil'^i^ ^'d- Then 

< \s{^d{J)) < \s{^d{J')) = XdsiJ') < r'^ ^ as z/ ^ oo. 
Hence A,(^rf(J)) = 0. □ 

22 



6 Appendix B: Proof of Lemma [7| 

Assume first that d > a. Let I = (/i, . . . , Ids) G Nq^ and let K = {« G {1, . . . , ds} : k > 0}. 
Let Ki = K n {{i — l)d +!,...,(« — l)(i + d}. First assume that ii', ^ for i = 1, . . . , s. 

Let / - 1^ = ((/i - 1)+, . . . , {Ids - 1)+) G N;^', where (x)+ = max(x, 0). Let Ai = {a = 
(ai, . . . , ads) e Nq'' : < tti < 6'' for 1 < i < ds} and 

ds 

[ab-\ {a + 1)6-') := JJi^^^'^ («^ + 1)^"'')- 

4=1 

Let g = (gi, . . . j^Qs), where g^ = [cti/fej. In the following we write [qb~'''^-^, {q + l)^-'^-'-) for 
ll'^^^[b-^^+\ai/b\,b-^^+\[a,/b\ +1). Further let 

%{[ab'\ (a + l)b-')) = {^d{x) e [0, l)':xe [ab-', (a + l)r')}. 

Let X G ^rf([ar', (a + l)r')), then 

^ /(^d(A;))6wals^,(fc)(a;) = / /(t) ^ 6wal^,(fc)(a; t) dt 



6l'l^ / /(t)dt. 



For I G Nq'' and a G Aj let 



Cl,a= / /(t)dt. 

'5?d([a6-',(a+l)()-']) 



For X G ^d([a6-', (a + 1)6"')) let 

g{x) := 5^(-l)l"l Yl /(^^(fc))6waW.w(^) 

= $^(-l)l"l&l'-(^-°)I^Q_(i„,o),(K/.j,a,,.,.,,.,,j, 

where ([au/fej, a{i^...^ds}\M) is the vector whose ith coordinate is [ai/b\ if i G m and Oj if 
i G {1, . . . , ds} \ u. 

Using Plancherel's identity we obtain 

<i,sAf) = E(-i)'"' E \md{k))\' = f \g{x)\' dx 

uCK fceA,_(i„,o) "^^ 
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^ b-\lh 



aGA, 



E(-l)'"'^""^'"'°^'^^'-(i"-o)-(L-/^J.«(.. 



s}\ii) 



J.CK 



aeAi 



5^(-i)Hri"iQ_(i„,o)^ 



([au/6J,a{i_..._ds}\u) 



MCif 



jiC_ft: 



We can simplify the inner sum further. Let e = b[a/b\ , i.e. the ith component of e is given 
by Ci = b[ai/b\ . Further let d = a — e, i.e. the ith component of d is given by di = ai — Cj. 
Then we have 

^(-l)l"l6-|"lQ_(i„,0),(La„/6J,a{i,...^rf,}\J 
uCK fc«6Ai„ 

^ ''2^ / . (~1) " Q,a+(fc„-d„,Oji d,-t\„) 



feeAi-^^d([«^-'.(«+i)^-'))«c/^ 



{i,...,ds}\«)))dt, 



where we extend the digit interlacing function ^d to negative values by using digits in {1 — 
6, . . . , 0} in case a component is negative. To shorten the notation we set 

6„{t) = 5^(-i)i"i/(t + ^,(r'(fc„-d„,0|i,...,,,}g)). 



uCK 



Therefore 






|4(t)|dt 



X E / I'^fc'WIdt 

^,g^^ ii?d([ab-',(a+l)fe-')) 

\5k'it)\dt. 
S>Mab-l,{a+l)b-'')) 

24 



Mt)\dt 



Using Cauchy-Schwarz' inequality we have 

Mt)\dt 
/ idt / Mt)\'dt 

1/2 



= ri'i^/^f/ Mt)\'dt) 

\Js'd{[ab-l,(a+l)b-']) / 

Let 5,,fc = (/^^([,5-.,(„+i)fe-,]) |4(t)|' dtj . Then we have 



fc,fc'eAi aeAj 



< max y^ Ba,kBa,k' 

k,k'eAi ^ 

aeAi 

= max > BI u, 



feeAi 



where the last inequality follows as the Cauchy-Schwarz inequality is an equality for two 
vectors which are linearly dependent. Let k* be the value of fc G Ai for which the sum 
SaeA -^a k takes on its maximum. Hence 



<iAf)<Y. [ \Sk<t)\'dt. 

„^4 J &J\ab-K(a+l)b-l]) 



aeAi J3^d(lab-i,(a+l)b-i]) 

The following lemma relates the function 6k to the divided differences introduced above. 

Lemma 11 Let I, a, e, q, K and Ki, . . . ,Ks be defined as above. For t G ^(^([afe^', (a + 
1)6^')) we have 

|4'(t)|<2^('^-")sup|A«(t';2i,...,2,)/| 

where a = {ai, . . . ,as) with Ui = min(|i^j|, a), and the supremum is taken over all t' G 
&d{[ab~^, {a + 1)6"')) and Zi = (zj,i, . . . ,^^,0,) with Zi^r, = n.^fi''^'^^''''^'''' where n^r, e {1 - 
6, ... ,6 — 1} \ {0} for 1 < Tj < \Ki\ and 1 < i < s and such that all the points at which f 
is evaluated in Aa{t'; Zi, . . . , Zg) are in ^Q,([g6~'^''"^, {q + 1)6~'"'"-'-^)). Furthermore, we may 
assume that \zi^i\ < \zi^2\ < ■ ■ ■ < \zi,\Ki\\ for 1 <i < s. 
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Proof. We show that Sk*(t) can be written as divided differences. Since the divided difference 
operators are apphed to each coordinate separately, it suffices to show the result for s = 1. In 
this case we have 

4.(t) = J](-i)i"i/(t + ^,(r'(fc:-d„,0|i,...,4g)), 

uCK 

where now K = {i E {I, . . . , d} : li > 0} . 

Let I = {h, . . . , /rf). Let t = ^ + |H , a = (ai, . . . , a^) and aj = aj^i^ + aj^i^-ib H h 

ttj^iU^^^. Then for t G ^d([a6"', (a + 1)&"')) "^^ have 

'tj+{i-i)d = o.j,i for 1 < / < Ij and j G K. 

Further we have dj = aj^i- for j G K. Let 

J = {j + (/-l)rf:l</</„jGi^}. 

Then for t G ^d([a6^', (a + 1)6~')) and -u C fi' we have 

jG-ft: /=i ieu jeK\u jen\i 

For given t G ^d([a6"', (a + 1)6"')) let 

r„ = t + ^d{p-\kl - d„, 0{i,...,4\„)). 

Let fc* = (A;*, . . . , /c^) and 

r. = -1 ^ for i G K 

Notice that if Zj = 0, then 6k* (t) = and hence we can exclude this case. Then for v G u 'O K 
we have 

E 

j£u\v 



'u 'V / J Zj. 



Therefore 

5At) = ^(-l)H/(t + ^,(6-'(fc:-d„,0|i,...,4\„))) 

uCK 



uCK 

26 



uQK 

uCK jdu 

= A\K\{t;z')f, 

where z' = {zj)j(zk- 

Notice that the ordering of the elements in z' does not change the value of A\K\{t',z'). 
Hence assume that the elements in z' are ordered such that z[ > Z2 > ■ ■ ■ > z',j^,. For the case 
where \K\ > a we obtain from the definition of the divided differences that 

uC{|i^| + l,...,a} V jeu ) 

By taking the triangular inequality and the supremum over all t' in \t + XlieM'^i' • "" — 
WK\ + 1, . . . , a}}, we obtain 

A|x|(t;;zO<2"^^lsup|A«(t';(^;,...,<))|. 
t' 

Consider now the general case s > 1 and K = {% ^ {l,...,(is} : /j > 0}. Let Ki = 
K n {{i - l)d + 1, . . . , {i - l)d + d} and a'^ = iT^il for 1 < i < s. Let a' = {a[, . . . , a'J. Let 

k* - ttji. 
Zj = - . . ,^^.// r^j for j e Ki and 1 < i < s. 

and 2'j = {zj)j(zKi for 1 < i < s. Then we obtain 

6k'{t) = A^,{t;z[,...,z'^)f. 

Define now ai = min(a, a'j) ioi 1 < i < s and a = {ai, . . . , as)- Notice that a'^ < d and 
therefore 

s 

2_]ic('i — ai) < s{d — a). 

i=l 

Notice that A^/ can be expressed as a sum an alternating sum of 2"«~"* summands A^j.. 
By taking the triangular inequality we therefore obtain 

|4*(t)| = \A^it; z,,..., Zs)f\ < 2^('^-°) sup |A«(t'; z,,...,Zs)\, 

where the supremum is taken over all admissible choices of Zi, . . . ,Zs and t' . □ 
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Hence 






where the supreinuin is over the same set as in Lemma 11 Therefore 



<,,,(/) < 2^(^-") Y, Vol(^,([a6-', (a + l)r'])) sup |A«(t'; ^i, . . . , 2,)/p 



aeA, 



< 2^(^-") 5^ Vol(^d([g6-'+i-,(g + l)6-'+i-]))sup|A„(t';2i,...,2,)/p 



'jeA;_i^ 



Let 7^. = (6 - i)6-i+(*-i)"'-('-i)'i for j G 7^^ and 1 < i < s. Let 7^,1 < 7^,2 < ■ • ■ < 7i,„, for 
1 < 2 < s be such that {7i,i, . • • ,7j,ai} = {7j : j G -K"j}, that is, {7ij : 1 < j < ««} is just a 
reordering of the elements of the set {7^ : j G Ki\. Set 7(Z) = 11^=1 11?=! 7«,j- Then 

<m(/) < r^'-'-h^i) E Voi(^„([gr'+^-,(g + i)6-'+^-]))sup^^^"--'^^^^'' 

cc4. . 1 lis 



where the supremum is over all admissible t and 21, . . . , z^ as described in the lemma. 

Consider now the case where Ki = ^ for some 1 <i < s. Let R= {i & {1, . . . , s} : /Tj = 0}. 
Then the result follows by replacing / with the function L ^,,^1 f{x) dx^ in the proof above. 

Let now d < a. Then Vdif) < V'q,(/), and hence the result follows by using the proof above 
with d = a. This completes the proof. 
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